<i>Rich Languages from Poor Inputs</i>
نویسندگان
چکیده
منابع مشابه
Knowledge-poor Approach to Constructing Word Frequency Lists, with Examples from Romance Languages
Word frequency lists extracted from documents are widely used in many procedures of text clustering and categorization. Usually for compilation of such lists morphological-based approaches (such as the Porter stemmer) to join the words having the same base meaning are used. However such an approach needs many language-dependent linguistic resources or knowledge when working with multilingual da...
متن کاملKnowledge-poor Approach to Constructing Word Frequency Lists, with Example from Romance Languages
Word frequency lists extracted from documents are widely used in many procedures of text clustering and categorization. Usually for compilation of such lists morphological-based approaches (such as the Porter stemmer) to join the words having the same base meaning are used. However such an approach needs many language-dependent linguistic resources or knowledge when working with multilingual da...
متن کاملImproved Statistical Machine Translation for Resource-Poor Languages Using Related Resource-Rich Languages
We propose a novel language-independent approach for improving statistical machine translation for resource-poor languages by exploiting their similarity to resource-rich ones. More precisely, we improve the translation from a resourcepoor source language X1 into a resourcerich language Y given a bi-text containing a limited number of parallel sentences for X1-Y and a larger bi-text for X2-Y fo...
متن کاملBuilding Text-to-Speech Systems for Resource Poor Languages
The focus of this research is to develop a method for building Text to Speech Systems for resource poor languages by using data from other languages to fine tune a general template polyglot TTS architecture. Our method involves three main componants: language clustering, phoneme mappings and prosody modelling. As a proof of concept, four TTS have been implemented for English, Spanish, Malay and...
متن کاملContrastive Learning of Emoji-based Representations for Resource-Poor Languages
The introduction of emojis (or emoticons) in social media platforms has given the users an increased potential for expression. We propose a novel method called Classification of Emojis using Siamese Network Architecture (CESNA) to learn emoji-based representations of resource-poor languages by jointly training them with resource-rich languages using a siamese network. CESNA model consists of tw...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ENGLISH LINGUISTICS
سال: 2017
ISSN: 0918-3701,1884-3107
DOI: 10.9793/elsj.33.2_616